Standardized network reconstruction of CHO cell metabolism 

Kieran Smallbone 



Manchester Centre for Integrative Systems Biology 

131 Princess Street, Manchester Ml 7DN, UK 

O . Ikieran . smallbone@manchester . ac . ukl 
(N 



m 



o 






Abstract 



< 

0^ ' We have created a genome-scale network reconstruction of Chinese hamster ovary (CHO) 

cell metabolism. Existing reconstructions were improved in terms of annotation standards, 
to facilitate their subsequent use in dynamic modelling. The resultant network is available 
from ChoNet ( ,http://cho.sf.net/| ). 



ChoNet 



The structure of metabolic networks can be determined by a reconstruction approach, using 
data from genome annotation, metabolic databases and chemical databases [1]. We built upon 
^ ■ an existing reconstruction of the metabolic network of CHO cells that was based on genomic 

and literature data (Selvarasu et al. [2]). This model contains 1065 genes, 1545 metabolic 
reactions, and 1218 unique metabolites. Use of in silico modelling allows characterisation 
internal metabolic behaviour during growth and non-growth phases [2]. 

(^ [ Selvarasu et al. suffers from the use of non-standard names and is not annotated with methods 

CO ' that are machine-readable. The model was thus updated according to existing community- 

driven annotation standards [3]. The reconstruction is described and made available in Sys- 
tems Biology Markup Language (SBML) (http://sbml.org/, [1]), an established community 
^ I XML format for the mark-up of biochemical models that is understood by a large number 

H ' of software applications. The network is available from ChoNet ( jhttp://cho. sf.net/ ). As 

- ■ ■ supplied, the model has an optimal growth rate of 0.0257 flux units. 

Annotation 

The highly-annotated network is primarily assembled and provided as an SBML file. Specific 
model entities, such as species or reactions, are annotated using ontological terms. These 
annotations, encoded using the resource description framework (RDF) [5j, provide the fa- 
cility to assign definitive terms to individual components, allowing software to identify such 
components unambiguously and thus link model components to existing data resources [6]. 
Minimum Information Requested in the Annotation of Models (MIRIAM, [7|) -compliant 
annotations have been used to identify components unambiguously by associating them with 



one or more terms from publicly available databases registered in MIRIAM resources [8]- 
Thus this network is entirely traceable and is presented in a computational framework. 

Six different databases are used to annotate entities in the network (see Tabled]). The Systems 
Biology Ontology (SBO) [9] is also used to semantically discriminate between entity types. 
Five different SBO terms are used to annotate entities in the network (see Table [2]). 



example 


identifier 


database 


ChoNet 

ChoNet 

cytosol 

N-methylhistamine 

1-oxidoreduct ase 

1-oxidoreduct ase 


10029 

22252269 

00:0005737 

CHEBI:29009 

1.1.99.1 

218865 


taxonomy 

pubmed 

obo.go 

chebi 

ec-code 

ncbigene 



Table 1: MIRIAM annotations used in the model. 
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SBO term 


interpretation 


cytosol 


290 


compartment 


N-methylhistamine 


247 


metabolite 


N-methylhistamine 


176] 


biochemical reaction 


AATRA20 


185 


transport reaction 


biomass objective function 


i397 


modelling reaction 



Table 2: SBO terms used in the model. 



Use 

We maintain the distinction between the CHO cell GEnome scale Network REconstruction 
(GENRE) [10] and its derived GEnome scale Model (GEM) [11] • This is important to dif- 
ferentiate between the established biochemical knowledge included in a GENRE and the 
modelling assumptions required for analysis or simulation with a GEM. A GENRE serves as 
a structured knowledge base of established biochemical facts, while a GEM is a model which 
supplements the established biochemical information with additional (potentially hypothet- 
ical) information to enable computational simulation and analysis [T2|. Reactions added to 
the GEM include the biomass objective function - a sink representing cellular growth - and 
hypothetical transporters. 

Three versions of the network are made available: 



<organism>_<version>.xml, a GEM for use in flux analyses, provided in Flux Balance 
Constraints (FBC) format J13j 

<organism>_<version>_cobra.xml, the same GEM network, provided in Cobra for- 
mat UM 



<organism>_<version>_recon.xml, a GENRE containing only reactions for which there 
is experimental evidence 



EcoliNet and YeastNet 

EcoliNet (http://ecoli.sf.net/) and YeastNet (http://yeast.sf.net/') are annotated metabolic 
network of Escherichia coli and Saccharomyces cerevisiae S288c, respectively, that are period- 
ically updated by a team of collaborators from various research groups. The three networks 
are structured identically to facilitate comparative studies. 
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